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ABSTRACT 

Dust extinction and reddening are ubiquitous in astronomical observations and are 
often a major source of systematic uncertainty. We present here a study of the cor- 
relation between extinction in the Milky Way and the equivalent width of the Nal D 
absorption doublet. Our sample includes more than 100 high-resolution spectra from 
the Keck telescopes and nearly a million low-resolution spectra from the Sloan Digital 
Sky Survey (SDSS). We measure the correlation to unprecedented precision, constrain 
its shape, and derive an empirical relation between these quantities with a dispersion 
of order 0.15 magnitude in E^-v- From the shape of the curve of growth we further 
show that a typical sight line through the Galaxy, as seen within the SDSS footprint, 
crosses about three dust clouds. We provide a brief guide on how to best estimate 
extinction to extragalactic sources such as supernovae, using the NalD absorption 
feature, under a variety of circumstances. 
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1 INTRODUCTION 

One cannot avoid observing through the dust that permeates 
galaxies. Dust both reddens and absorbs radiation in ways 
that are usually difficult to quantify. These effects depend on 
the amount of dust along the line of sight as well as on the 
dust properties such as chemical composition and grain size 
distribution (see Draine 2003 for a review). Effectively, every 
observation (most notably in the UV and optical regimes) 
suffers from a systematic and usually unknown effect. 

A tool often-used to correct for the extinction and red- 
dening incurred is the strength of the absorption doublet of 
sodium Nal D AA5890, 5896, a well-known tracer of gas, met- 
als, and dust. The doublet absorption strength is generally 
expected to correlate with the amount of dust along the line 
of sight. Richmond et al. (1994) have shown, using 57 high- 
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resolution stellar spectra taken by Sembach et al. (1993), 
that the equivalent widths (EWs) of the individual compo- 
nents of NalD do indeed correlate with the color excesses 
measured for these stars, albeit with a noticeable scatter. 
Munari & Zwitter (1997) add a body of 32 stars observed 
with somewhat lower resolution (R « 16,500 vs. R « 60,000- 
600,000); they constrain the nonlinearity of the relation, fol- 
lowing the curve of growth, and discuss its pitfalls. They 
find that the precision is limited to about SEb-v~ 0.05- 
0.15 mag for a given measurement. They also show how at 
EW > 0.5 A the NaID2 line saturates and the relation flat- 
tens 1 . At low extinctions, and therefore low column densi- 
ties, the EW is supposed to track Eb-v linearly. 



1 We note that MZ97 invert the notations for the two lines. We 
will follow the traditional Fraunhofcr notation, assigning Di to 
the redder line at A5897 and D2 to the bluer, typically stronger, 
line at A5891. 
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This useful correlation in high resolution spectra was 
recently shown not to extend in any practical way to low 
resolution spectra (Poznanski et al. 2011, see also Blondin 
ct al. 2009). While the correlation survives the degradation 
in resolution, the large scatter in the relation is combined 
with confusion due to the blending of the doublet. In addi- 
tion, there can be a variable and unknown continuum con- 
tamination from nearby sources due to variations in seeing 
conditions. 

In this paper we measure the correlation between Nal D 
EW and extinction in the Milky Way (MW) to a significantly 
higher precision than achieved before. We show that indeed 
the correlation is robust based on two different datasets but 
that the functional shape of the correlation differs signifi- 
cantly from what has been derived before. We discuss the 
implications of these differences in terms of the inference 
one can make about the dust content of the MW. We also 
derive a simple empirical law that allows one to estimate an 
extinction based on measured Nal D equivalent widths. 

While the correlation between dust extinction and the 
EW of NalD has been shown to exist, it is of value to see 
how it arises theoretically. 

By definition, 



EW = 




which, at low optical depths r translates to 

EW oc r. (2) 

The optical depth is the column density of the neutral 
sodium, A^Nai, times the absorption coefficient, which we de- 
note as kq. For a given gas-to-dust ratio / g 2d, a sodium frac- 
tion /n 3 (the mass fraction of all the gas that is in sodium) , 
and sodium ionization fraction /i on , we therefore get 

EW OC Ko(l — /ion)/Na/g2dA r dust, (3) 

where Adust is the column density of the dust. 

We next take the definition of the total to selective ex- 
tinction, 



Ay is proportional to N duBt . We can assume any arbi- 
trary function, which we will denote as F. So 

AW st =F(R Y ,E B -v), (5) 

which we can finally combine with Equation 3, to get 

EW (X K (l - /ion)/Na/ g 2dF( J Rv, E B -v). (6) 

Given the large number of proportionality factors, and 
how uncertain (and varying) they are in the MW, it is quite 
astounding that a correlation between EW and Eb-v sur- 
vives at all. Complex physical processes in the ISM fur- 
ther muddy the picture as, for example, the sodium frac- 
tion will be influenced by depletion that in turn depends on 
the dust content. Furthermore, /i on is large, and sensitive to 
photo-ionization as a result of the low ionization potential 



of sodium (5.1 eV). Perhaps what saves the day is a more di- 
rect correlation between metal content and dust, bypassing 
all of the gas-related variables. 

A further complication, which will have significant im- 
plications on the shape of the relation, is that a random 
sight line through a galaxy will often cross more than one 
gas cloud. This allows the EWs of the lines to grow be- 
yond saturation if one does not possess the resolution to 
separate the systems. Even if the absorption lines of the dif- 
ferent systems can be separated, the extinction incurred by 
the different clouds cannot be. MZ97 resolve this issue by 
removing from their sample stars that show multiple absorp- 
tion systems. This effectively assumes that the instrumental 
resolution is better than the velocity dispersions of the sys- 
tems and that those systems that appear single are indeed 
fully resolved. In this paper we sum the contributions from 
different components, even when we can separate them, ef- 
fectively averaging over the typical number of clouds crossed 
by a sight line. 

As wc show below, the correlation does exist, with a 
surprisingly low scatter, yet with a functional structure that 
reflects somewhat the richness of the processes involved. 

In Section 2 we present our samples, which we inter- 
compare in Section 3. In Section 4 we look at the correlation 
between NalD EW and -Eb-v, deriving the empirical tool 
in Section 5. We conclude in Section 6. 



2 DATA AND MEASUREMENT 

Our purpose is to measure the EWs of Nal D and to com- 
pare them to -Eb-v values. For the latter we use the maps 
of Schlegel et al. (1998, hereafter SFD) who measured the 
distribution of dust throughout the sky using far infrared 
emission as a tracer of dust content. Recently Schlafly et al. 
(2010) and Schlafly & Finkbeiner (2011) have found that 
the SFD maps may need to be re-normalized as they over- 
predict extinctions by 14%. These authors also find that 
there are offsets between the northern and southern skies 
as well as higher order corrections in some smaller regions 
of the sky. However, they do not provide improved maps 
yet and claim they will do so when more data are available. 
Since this rescaling is a temporary fix, we do not apply it, 
and note that it does not affect our result in any significant 
fashion. A reader who prefers to apply this scaling can sim- 
ply multiply any color excess value that we derive based on 
the SFD normalization by 0.86. 

In order to measure the narrow sodium absorptions 
we require large numbers of high-resolution observations of 
background sources. We compile data from two different 
catalogs, with quite different properties. One sample con- 
sists of over a hundred high-resolution spectra of quasars 
(QSOs), where the analysis of absorption lines is straight- 
forward. A second sample includes hundreds of thousands of 
low-resolution spectra from SDSS that require more subtle 
methods. The samples complement each other and allow for 
various consistency checks. 
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2.1 High- Resolution Sample 

Over the past two decades many tens of QSOs have been ob- 
served at high resolution with the Keck telescopes for various 
science goals. We use here a sample of 117 QSOs that were 
obtained using the HIRES (43 spectra) and ESI (74 spec- 
tra) cameras. This sample includes only spectra that cover 
the wavelength range of Nal D. We henceforth call this com- 
bined set the 'HIREZ' sample. 

All of the spectra were drawn from the the UCSD/Keck 
Damped Lya Database (Prochaska et al. 2007) and surveys 
of metal-strong damped Lycv systems (Herbert-Fort et al. 
2006) The spectra were reduced with either the MAKEE, 
HIRedux or ESIRedux packages to optimally extract, cali- 
brate, and coadd the spectra. Each spectrum was continuum 
normalized and shifted to the heliocentric vacuum reference 
frame. We refer the reader to those papers for full details on 
data acquisition and reduction. 

We examined each spectrum near the wavelength of 
NalD for the presence of the doublet which we could as- 
sociate with Galactic absorption. Despite the high spectral 
resolution (i? « 40,000-50,000 for HIRES, and R « 6,000- 
9,000 for ESI), the data were occasionally compromised by 
strong Nal D emission and those few spectra were eliminated 
from the analysis. Examples of the spectra that were con- 
sidered is presented in Figure 1. 

For the spectra that were free of excessive sky contami- 
nation, we measured the equivalent width of each member of 
the doublet using boxcar summation and calculated the cor- 
responding error, summing over multiple systems whenever 
they are resolved. Table 1 summarizes these measurements. 
We caution that the error estimates for these measurements 
ignore the systematic error of continuum placement which 
imposes an additional ~ 5 — 10% uncertainty. Due to the 
poorer resolution of the instrument, the ESI measurements 
typically have larger scatter and uncertainty. 

2.2 SDSS Sample 

SDSS observed more than 100, 000 QSOs, and nearly a mil- 
lion galaxies, albeit with low resolution (R ~ 1800) and 
often with quite low signal to noise ratio (S/N). Our sam- 
ple includes all the spectra in the 8th data release of SDSS, 
though we use only spectra with redshifts z > 0.005 to mit- 
igate any contamination from Nal D absorption in the rest- 
frames of the QSOs and galaxies. After cleaning out a few 
spectra of very low quality, we remain with 121,391 QSOs 
and 860, 883 galaxies, nearly one million spectra in total, to 
which we refer as the SDSS sample. 

In order to detect and measure the NalD line in the 
SDSS spectra they must be combined in large numbers (typ- 
ically more than 200; we test this throughout this paper). We 
attempted to measure the EWs on individual SDSS spectra, 
so as not to resort to binning. However for the vast majority 
of spectra we could not get a double gaussian fit to converge 
and give a useful measurement. We therefore create high- 
S/N spectra by means of various stacking schemes. First 
we linearly interpolate every spectrum to an identical and 
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Figure 1. Representative samples of Keck/ESI (left) and 
Keck/HIRES QSO spectra covering the NalD doublet absorp- 
tion due to the Milky Way. Each spectrum has been continuum 
normalized. These QSO spectra show occasional absorption from 
extragalactic systems in the wavelength region of interest (e.g. at 
A w 5894. 5A) toward J1131+6044. Such features were carefully 
identified and ignored in our analysis. Blue lines are the la error 
arrays. 



denser wavelength grid in the range 5800 - 6000 A. We then 
divide the spectra by a low order polynomial in order to re- 
move continua of any shape and to normalize flux levels. We 
group them with different binning rules and last, combine 
them using median filtering. Since the Nal D lines are often 
weak and always blended, we measure the EW by fitting 
a double Gaussian to the relevant wavelength region. This 
is a two step process where we first fit for the stronger D2 
line (restricting our fit to wavelengths smaller than 5894 A; 
fitting for the line center, width, and amplitude), we then 
subtract the best fit Gaussian from the spectrum, and finally 
similarly fit for the D\ line (at A > 5984 A). We find that 
this method robustly recovers the doublet at all detectable 
strengths. 

In Figure 2 one can see the effect of combining all the 
SDSS spectra, though only 10 random ones are shown in 
the background for clarity, as well as the 5 and 95 percentile 
curves. Despite the large amplitude noise in the spectra (in 
fact at our wavelength of interest the noise is even greater 
due to residual contamination from atmospheric Nal D lines) 
we recover a strong doublet at the NalD wavelengths. The 
resulting spectrum has the theoretical resolution of the 
SDSS spectrograph, of order 2.5 A at this wavelength. 
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Figure 2. We median combine all of the (normalized) SDSS 
spectra (nearly a million), the resulting spectrum with a clear 
detection of the Nal D doublet is plotted with a thick black line. A 
random subset of ten input spectra are shown in the background 
in various colors, in order to demonstrate their noise properties. 
In thick red lines we show the bottom 5 and top 95 percentile 
curves. 



3 HIREZ VS. SDSS: COMPARING THE 
SAMPLES 

By construction, both samples are typically at high Galactic 
latitude and low Galactic extinctions, as derived from the 
SFD maps. Figure 3 shows the respective distributions in 
Eb-v- The distributions match well (which we confirm by 
a Kolmogorov-Smirnov test), and mostly probe the region 
of _Eb-v< 0.1. However within the gargantuan number of 
SDSS spectra there is a non negligible fraction at higher 
extinctions, up to Eb-v~ 6. 

In order to assess whether the Nal D EWs for both sam- 
ples are consistent and to determine the scatter in the mea- 
surements, we compare the samples directly as follows. For 
every set of coordinates of a HIREZ object we would like 
to stack SDSS spectra of targets in a similar position in the 
sky. The maximum angular separation is a compromise be- 
tween two effects. A larger separation would improve the 
S/N in the combined spectrum, at the cost of larger scat- 
ter due to variance introduced by averaging over too differ- 
ent sight lines. Accordingly, a smaller separation will reduce 
significantly the number of sources, and limit our ability to 
measure the EW. 

Requiring at least 200 SDSS objects per bin (otherwise 
the combined spectra are too noisy), and trying a range of 
thresholds, we measure the EW of the (stronger) Di line 
and compare the EWs of the HIREZ spectra versus stacked 
SDSS EWs, and perform a linear fit. The slope between the 
two samples (which should be of order unity) is near 0.9 
for maximal separations (i.e., the diameters of the circular 
regions around each HIREZ object) of 2 — 4°. Greater sepa- 
rations show a larger offset, as expected, up to about 20%. 



This scale is consistent with our measurement of the typi- 
cal separation on the sky over which the SFD maps change 
significantly: for our fields (effectively the SDSS footprint) 
a standard deviation of 0.01 mag in Eb-v occurs within a 
distance of 2° (see however some works that measure vari- 
ations on much smaller scales: Frail et al. 1994; Meyer & 
Lauroesch 1999; Andrews et al. 2001). Quite independently 
of distance, the systematic uncertainty (i.e., the added un- 
certainty required for x 2 P er degree of freedom (d.o.f) ~ 1) 
is about 70 mA. Figure 4 shows an example for a separa- 
tion threshold of 4° , though plotted in term of the residuals 
between the samples, for clarity. 

The procedure above supplies us with two numbers. 
First, our systematic uncertainty is of order 70 mA. A few 
sources contribute to this number: variations on small scales 
of the Nal D absorption, the effects of binning, sky subtrac- 
tion errors, and perhaps underestimated measurement un- 
certainties. Second, we find a systematic shift, where mea- 
surements on the SDSS spectra are consistently underesti- 
mated by about 10%. This could be expected because of 
the following effect. As we bin and combine multiple SDSS 
spectra with a skewed distribution of extinctions, the un- 
avoidable pollution of one bin by members from adjacent 
bins is asymmetric. The dominant population contaminates 
the remainder more significantly than vis versa. Since most 
of our spectra suffer from low-extinction and small EW, any 
contamination will tend to dampen the EW measured. 

This explains why at large threshold distances the bias 
grows, as more and more different (and uncorrelated) sight 
lines are combined. This also entails a bias when we bin spec- 
tra according to their Eb-v values, as derived from the SFD 
maps. Since these maps have a typical scatter of 0.01 mag 
we expect a bias of 10%. We examine this bias further in 
Section Al. In order to correct for this bias, we add a 10% 
correction to every EW value derived from SDSS stacked 
spectra. This correction is small enough to have little effect 
on our qualitative and quantitive findings. 



4 EW VS. Eb-v FROM SFD MAP 

We next turn our attention to comparing the Nal D EWs as 
derived from both datasets to the extinction posited by the 
SFD maps. 



4.1 HIREZ sample 

Figure 5 shows the EWs as a function of Eb-v for the 
HIREZ sample. There is a clear correlation between those 
two values (Pearson's correlation coefficients of 0.7 and 0.6 
for the D2 and Di lines respectively, with p values smaller 
than 10 -10 ). The correlation is more readily apparent to the 
eye when one bins the data over Eb-v, where the bin edges 
are set to have about 20 measurements per bin. While ro- 
bust, the correlation does not follow the trend we expect 
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Figure 3. -Eb— v distribution for our samples of spectra, SDSS 
QSOs in black, SDSS galaxies in blue, and HIREZ in red. Clearly, 
these samples are similar, and mostly probe the low extinction 
range, however the large number of SDSS spectra does include 
more highly reddened objects. 
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Figure 4. We compare the EWs derived for both samples at a 
similar position on the sky. Plotted here are the residuals between 
the measured EWs of the stronger D2 line for a maximal sepa- 
ration of 4° between binned SDSS spectra. We find a systematic 
uncertainty a ays of order 70 mA, and that stacking introduces a 
bias of about 10% as discussed in the text. Residuals in black 



from the curves of either R94 or MZ97 2 . If we assume a lin- 

2 Note that MZ97 do not actually fit for the NalDi correlation. 
They discuss the fact that Nal D2 should be twice as strong due to 
ratio of oscillator strengths but that the lines should get closer in 
strength as the stronger line saturates. They find that their data 
corroborates this, asymptotically converging toward a ratio of 1.1. 
We therefore simulate the MZ97 NalDi curve by modulating the 
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Figure 5. NalD EW vs. E B -v for the HIREZ sample (bot- 
tom: the Di line, top: D2). The correlation is most apparent for 
the binned data or for the HIRES data, since the ESI EWs are 
much noisier. However the correlation is significant and consis- 
tent across the subsamples. Our best fit linear curve (blue line) 
clearly diverges from the derivations of MZ97 (dashed lines) and 
R94 (solid lines). 



ear relation between NalD EW and Eb-v in that range of 
extinctions, our best fit slope is about twice larger than the 
slope of the MZ97 curve. The systematic scatter is of order 
50 mA. 



4.2 SDSS sample 

We next turn our attention to the SDSS sample, where we 
bin the spectra over similar Eb-v values as derived from the 
SFD maps. Here again the number and sizes of bins were not 
determined arbitrarily. We tested various procedures that 
all gave consistent results. For 0.02 <£"b-v< 0.3, where 
we have an abundance of spectra and signal, rather than 



NaID2 best fit of MZ97 with a function that starts at 2 and 
exponentially reaches a value of 1.1. 
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fixing the sizes of the bins, we fix the number of objects 
per bin. The number of bins for this extinction range was 
determined by varying it from 5 to 200, every time fitting a 
quadratic function to the resulting EW vs. Eb-v- We found 
that the results are always consistent and stable and do not 
depend on the number of bins. However, outside this range of 
extinctions, we only have about 1500 spectra at Eb-v> 0.3, 
spanning a large range of extinctions - going to Eb-v~ 6 
(with only 54 in the range 2 <£"b-v< 6). For these we 
therefore set additional bin edges at color excesses of 0.5, 
1, and 2. All the spectra at _Eb-v< 0.02, where the EW is 
very small and difficult to measure, are stacked in a single 
bin. We present below results using a total of 50 bins that 
seem to well sample the relation. 

Figure 6 shows the 50 binned SDSS spectra, as well as 
the fit to the NalD lines, sorted from low extinction (top) 
to high extinction (bottom). Clearly the spectra have ex- 
tremely high S/N, deteriorating as we reach the highest bins 
where objects are scarcer. The doublet is well detected, and 
apparently the double Gaussian fit works well, and recovers 
the line shapes. Another obvious result is the clear correla- 
tion between the EW of both lines and extinction. One can 
follow the curve of growth by eye, as D2 gets stronger first, 
stalls and is nearly caught up with by the weaker D\ line 
that saturates later. 

In Figures 7 and 8 we show the EW vs -Eb-v at low 
extinctions, and over the full range that we probe, respec- 
tively. There are quite a few remarkable features in these 
plots. First, the correlation is very tight, with a similar slope 
at low extinction to the one derived from the HIREZ sample. 
This is true for both Nal D lines. Second, similarly to what 
we find for the HIREZ sample, but even more so here, there 
is a very clear discrepancy between the curves derived by 
R94 or MZ97 and our findings. Last, while the correlation is 
quite linear up to extinctions of about 0.06, it then flattens, 
reaching a first plateau around 350 mA (D2; 270 mA for Di), 
but with a much more striking saturation feature occurring 
at E B -v> 0.7, and an EW near 1000mA (D 2 ; 800mA for 
Di), as readily apparent in Figure 8. 

If we combine all the SDSS spectra in one bin the result- 
ing spectrum has EWs of 160 mA and 90 mA for the Nal D 
lines. At a median -Eb-v of 0.026 this is consistent with the 
results of the 'less aggressively binned' data. 




5870 5880 5890 5900 5910 5920 5930 

Wavelength [A] 

Figure 6. Binned SDSS spectra sorted from low to high ex- 
tinction (top to bottom), superposed with the best fitting dou- 
ble Gaussian (black). The strong correlation between the EW of 
Nal D and extinction is clearly apparent to the unaided eye. 



5 DISCUSSION 

5.1 Comparison to previous results 

Since the data and analysis of MZ97 are more relevant at 
the extinction ranges we probe, in addition to their curves 
being less discrepant from our data, we hereafter focus on 
these rather than on the results of R94. 

Our data seem to point to a rather simple multiplicative 
re-scaling of the MZ97 curves. We perform joint fits of our 
two (binned) samples to the best fitting MZ97 curves for 
both NalD lines and find that models where EW-> niEW 
or Eb-v - ► ri^Es-v bring the MZ97 curves to our data (or 



vis versa). A systematic uncertainty of order 30 mA suffices 
to reach x 2 /dof = 1. 

The best fit requires the EWs to be multiplied by about 
1.5. Alternatively, to Eb-v we need to apply a factor of 2. 
This result is stable, whether we fit only one sample or the 
other, one absorption feature or the other. A value of 1 for 
the stretch (i.e., no change) is > 5a away from our best fit. 

We find it unlikely that MZ97 have underestimated 
their EW measurements by a factor of 1.5, considering the 
fact that they span such large extinctions (up to Eb-v = 
1.5), with commensurately large EWs (up to 600mA). In 
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Figure 7. NalD EW vs. £?B-V f° r t- ne SDSS sample (squares; 
bottom: Di, top: D2, at low extinctions. While there is a clear 
correlation consistent with the one derived from the HIREZ sam- 
ple (blue line), it again diverges from the curves derived by MZ97 
(dashed lines) and R94 (solid lines), with an intricate nonlinear 
structure. 



view of the strength of the signal, in two independent 
datasets with different methods, we find it equally improb- 
able that we overestimated ours by that same amount. 

Alternatively MZ97 may have overestimated their 
Eb-v values by a factor of 2, or that the SFD extinctions 
we use are as severely underestimated. Here again neither 
explanation is very attractive. 

The only remaining possibility is that somehow we are 
not comparing 'apples to apples', by probing a different ISM 
from the one MZ97 probe on average. Restricting ourselves 
initially to low optical depths, we find a much steeper slope 
than the one observed by MZ97. This could be explained if 
we allow a systematic change between the samples in one of 
the parameters of our Equation 6, namely, «o, /ion, /ns, / g 2d, 
or Ry. Obviously all these factors are not well measured, 
known, or understood, except for some very specific envi- 
ronments. However, we can explore qualitative arguments 
on the amount and direction of change we can expect when 




E B -v [mag] 

Figure 8. Same as Figure 7, for the full range of extinctions. 

moving from the low galactic latitude, disk environment of 
the MZ97 stars to the high galactic latitude, bulge-like gas 
and dust we that we are mostly observing. 

Perhaps the simplest parameter is Ry. In the denser 
gas- and dust-rich star forming environments of the MZ97 
stars we expect larger grains to be more easily built. Larger 
grains lead to larger values of Rv and therefore this effect 
would work in a direction opposed to what we measure. 

An additional dependency is similarly ruled out from 
simple arguments. The Galactic metallically gradient would 
reduce /n 3 in our sample when compared to more central 
regions. This effect would therefore work in the wrong di- 
rection. 

The three remaining parameters are the ionization frac- 
tion, variations in the gas-to-dust ratios, or changes in the 
depletion of sodium. The ionization fraction will depends on 
the radiation field, which in turn can vary significantly in 
different regions. In the denser disk of the galaxy we expect 
larger UV fluxes, and a greater ionization fraction. Accord- 
ing to Savage & Sembach (1996), one finds a larger gas-to- 
dust ratio in the halo. Last, in dust rich environments we 
perhaps expect greater depletion of the sodium from the gas 
to the solid phase. All of these effects will make fjy a greater 
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for our sample, so could explain what we measure. However, 
they would more markedly affect high extinction regions, 
which is not what we measure - an overall stretch of the 
correlation. There is also evidence that sodium depletion is 
rather constant and independent of environment (Phillips 
et al. 1984). 

We checked whether the correlation depends on the 
Galactic latitude by breaking the SDSS sample to two sub- 
samples, at absolute Galactic latitudes above or below 50°. 
We find no measurable difference between the two correla- 
tions. While one does expect differences in the dust distri- 
bution and properties as a function of galactic latitude, it 
seems that binning the data in -Eb-v erases the distinction 
between the environments. 

The answer probably lies in the different treatment of 
multiple line systems. MZ97 make a point of using solely 
those stars towards which they resolve only single systems, 
and they reject from their fit stars that show more than one 
set of absorption lines. As expected, the stars they reject 
tend to have larger EW since more clouds allow for more 
absorption to happen at a different velocity, circumventing 
saturation. However, our sample lies outside the Galaxy, and 
our lines of sight cross multiple clouds on average. This is 
also the most likely scenario when one observes an extra- 
galactic source and wants to correct for host (or MW) ex- 
tinction. This, at least qualitatively, explains why we tend 
to measure a steeper correlation. Indeed our data and best 
fit relation seem consistent with the handful of multiple-line 
systems that MZ97 reject from their analysis. 

Whether individual systems are resolved or not depends 
on the instrumental resolution as well as the velocity disper- 
sion in the clouds. Many of our spectra observed with the 
HIRES spectrograph clearly show multiple systems, often 
separated in velocity by a few kms -1 , while most of the 
rest have complex line shapes that are neither Gaussian nor 
Lorentzian. This can be seen in Figure 1. We assume the 
same applies for the ESI and SDSS spectra that do not offer 
the resolution or the S /N to assess this. For that reason and 
for consistency between the samples, we add up the EWs 
whenever multiple systems are resolved in the HIREZ sam- 
ple. 

This also sheds light on the functional form of the cor- 
relation we find. The first 'plateau' in the correlation can be 
attributed to saturation from single clouds. For a Doppler 
parameter b « 5kms^\ the NalD doublet should indeed 
begin to saturate at w 200 mA. 

However, most of our sight lines across the MW will 
traverse multiple clouds. These individual systems will not 
be resolved in our SDSS spectra, and can contribute to the 
measured EW, allowing the curve of growth to continue ris- 
ing past saturation. The second, stronger saturation, that 
occurs at EWs near 1000mA (D 2 ; 800mA for Di) might 
therefore be the result of all these clouds saturating. Since 
our method averages over multiple sight lines, the ratio be- 
tween the two saturation levels is the average number of 
such clouds along a typical line of sight, about three. 
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Figure 9. log 10 (i?B— v) vs - NalD EW for both samples merged. 
D2 top, Di middle, and their sum in the bottom panel. The cor- 
relation can be well approximated with a linear fit when one takes 
the logarithm of the color excess. The light blue band marks the 
1(7 uncertainty which is dominated by the intrinsic scatter in the 
correlation. 



5.2 Empirical Relation and Practical Guide 

We aim to provide the reader with the simplest yet precise 
description of the correlation, in order to derive extinctions 
from measured NalD lines. Having tested various functions 
to fit the data, we found that the logarithm of the color ex- 
cess is, within the uncertainties, quite linear with respect to 
the equivalent widths. This is shown in Figure 9, where the 
relations to both lines (and their sum, in case they are not 
resolved) are shown. Note that for convenience we switch the 
axes when compared to previous figures, and use A units for 
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the EW rather than mA. The uncertainties in the derived 
parameters are dominated by the systematic uncertainty of 
70 mA which we propagate. This should encompass the nat- 
ural scatter in the correlation and entails an uncertainty in 
derived color excesses of 30-40%. The relation for the D2 
line is: 

log 10 (£ B -v) =2.16 x EW(D 2 ) - 1.91 ± 0.15, (7) 
for the Di line: 

log 10 (£ B -v) =2.47 x EW(Di) - 1.76 ± 0.17, (8) 
and for a measurement of both lines combined: 

log 10 (£ B -v) =1-17 x EW(Di + D 2 ) - 1.85 ± 0.08. (9) 

There are a noteworthy caveats regarding these rela- 
tions. First, for Galactic (or very low redshift) sources the 
contamination from atmospheric sodium might be difficult 
to remove especially in lower-resolution spectra. 

Second, if there are indications that the extinction of 
a given source is dominated by circumstellar dust rather 
than a diffuse interstellar component there is no guarantee 
that the extinction will follow the relation we derived. Such 
dust may have a different composition, gas-to-dust ratio, etc. 
Similarly, our derivation is based on the MW, and therefore 
implicitly assumes an average R v =3.1. This may not apply 
in Galactic regions or external galaxies where one expects a 
different extinction law, with different dust properties. 



6 CONCLUSIONS 

One would expect wide variations between environments 
in the parameters that link the column density of neutral 
sodium and the absorption and scattering properties of dust. 
Yet, we have found a strong correlation between the EW 
of NalD and dust extinction with a typical scatter below 
100 mA. 

The relation has a functional shape that differs from 
previous determinations. We find that this is likely due to 
the fact that previous studies have only considered stars be- 
hind single dust clouds, while extragalactic sight lines will 
usually cross a number of clouds. Our measurement is the 
most precise to date, and should serve as a useful tool to cor- 
rect for the effects of dust on the observations of extragalac- 
tic sources, whenever high-resolution spectroscopy exists, or 
can be obtained. This has already been accomplished for 
the case of SN2012aw (Van Dyk et al. in preparation) where 
extinction estimates show that the supernova suffers from 
little extinction, while the progenitor star was surrounded 
by a significant amount of dust, showing that the supernova 
has destroyed the dust surrounding it. 

We further have studied the shape of the relation, which 
sheds light on the structure of the MW. We show for example 
that within the SDSS footprint, i.e., mostly outside the disk 
of the Galaxy, a typical sight line crosses about three dust 
clouds within the MW. 



ACKNOWLEDGMENTS 

In addition to E. Schlafly, our thoughtful and swift ref- 
eree, we thank the following (alphabetical, probably non- 
exhaustive) list of people for useful discussions, ideas, and 
comments on the manuscript: A. V. Filippenko, O. Gnat, A. 
Loeb, D. Maoz, AA. Miller, P.E. Nugent, E.O. Ofek, JA. 
Rice, B.P. Schmidt, M. Sullivan, B. Trakhtenbrot. 

Some of the data presented herein were obtained at the 
W.M. Keck Observatory, which is operated as a scientific 
partnership among the California Institute of Technology, 
the University of California and the National Aeronautics 
and Space Administration. The Observatory was made pos- 
sible by the generous financial support of the W.M. Keck 
Foundation. The authors wish to recognize and acknowl- 
edge the very significant cultural role and reverence that the 
summit of Mauna Kea has always had within the indigenous 
Hawaiian community. We are most fortunate to have the op- 
portunity to conduct observations from this mountain. 

Some the data presented were obtained from the SDSS 
III archive. Funding for SDSS-III has been provided by 
the Alfred P. Sloan Foundation, the Participating Insti- 
tutions, the National Science Foundation, and the U.S. 
Department of Energy Office of Science. The SDSS-III 
web site is http://www.sdss3.org/. SDSS-III is managed 
by the Astrophysical Research Consortium for the Par- 
ticipating Institutions of the SDSS-III Collaboration in- 
cluding the University of Arizona, the Brazilian Partici- 
pation Group, Brookhaven National Laboratory, Univer- 
sity of Cambridge, Carnegie Mellon University, Univer- 
sity of Florida, the French Participation Group, the Ger- 
man Participation Group, Harvard University, the Insti- 
tuto dc Astrofisica de Canarias, the Michigan State/Notre 
Dame/JINA Participation Group, Johns Hopkins Univer- 
sity, Lawrence Berkeley National Laboratory, Max Planck 
Institute for Astrophysics, New Mexico State University, 
New York University, Ohio State University, Pennsylva- 
nia State University, University of Portsmouth, Princeton 
University, the Spanish Participation Group, University of 
Tokyo, University of Utah, Vanderbilt University, University 
of Virginia, University of Washington, and Yale University. 



REFERENCES 

Andrews S. M., Meyer D. M., Lauroesch J. T., 2001, ApJ, 
552, L73 

Blondin S., Prieto J. L., Patat F., Challis P., Hicken M., 
Kirshner R. P., Matheson T., Modjaz M., 2009, ApJ, 693, 
207 

Burstein D., Heiles C, 1982, AJ, 87, 1165 
Draine B. T., 2003, ARA&A, 41, 241 

Frail D. A., Weisberg J. M., Cordes J. M., Mathers C, 
1994, ApJ, 436, 144 

Hartmann D., Burton W. B., 1997, Atlas of Galactic Neu- 
tral Hydrogen 



© 0000 RAS, MNRAS 000, 000-000 



10 Poznanski et al. 



Herbert-Fort S., Prochaska J. X., Dessauges-Zavadsky M., 
Ellison S. L., Howk J. C, Wolfe A. M., Prochter G. E., 
2006, PASP, 118, 1077 
Meyer D. M., Lauroesch J. T., 1999, ApJ, 520, L103 
Munari U., Zwitter T., 1997, A&A, 318, 269 
Peek J. E. G., Graves G. J., 2010, ApJ, 719, 415 
Phillips A. P., Pettini M., Gondhalekar P. M., 1984, MN- 
RAS, 206, 337 

Poznanski D., Ganeshalingam M., Silverman J. M., Filip- 

penko A. V., 2011, MNRAS, 415, L81 
Prochaska J. X., Perley D. A., Modjaz M., Bloom J. S., 

Poznanski D., Chen H., 2007, GRB Coordinates Network, 

6864, 1 

Richmond M. W., Treffers R. R., Filippenko A. V., Paik 
Y., Leibundgut B., Schulman E., Cox C. V., 1994, AJ, 
107, 1022 

Savage B. D., Sembach K. R., 1996, ARA&A, 34, 279 
Schlafiy E. F., Finkbeiner D. P., 2011, ApJ, 737, 103 
Schlafly E. F., Finkbeiner D. P., Schlegel D. J., Juric M., 

Ivezic Z., Gibson R. R., Knapp G. R., Weaver B. A., 2010, 

ApJ, 725, 1175 
Schlegel D. J., Finkbeiner D. P., Davis M., 1998, ApJ, 500, 

525 

Sembach K. R., Danks A. C, Savage B. D., 1993, A&AS, 
100, 107 



© 0000 RAS, MNRAS 000, 000-000 



Sodium & Dust 11 

Table 1. ESI and HIRES MEASUREMENTS 
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Table 1 (cont'd) 



QSO 


RA 


DEC 


NaID 2 EW 


NalDi EW 




(deg) 


(deg) 


(mA) 


(mA) 
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35.1 ±29.7 
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Table 1 (cont'd) 
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PKS1354-17a 


209 


2753 


-17.7339 


553.1 ±74.8 


203.2 ±39.9 


J141030+511113a 


212. 


.6275 


51.1872 


71.4 ±33.1 


41.3 ± 19.1 


J1435+5359 


218 


.7521 


53.9983 


75.8 ±7.8 


21.3 ± 5.9 


J1541+3153a 


235. 


.4727 


31.8917 


233.0 ± 12.2 


136.6 ± 15.7 


J1552+4910 


238 


.1412 


49.1689 


136.3 ±20.4 


52.6 ± 16.8 


J1558-0031 


239. 


5423 


-0.5222 


403.8 ± 15.9 


300.1 ± 14.2 


J1604+3951a 


241. 


.0582 


39.8561 


134.1 ± 11.0 


59.7 ± 14.6 


Q1759+75T 


269. 


.4433 


75.6544 


361.2 ± 14.5 


223.9 ± 13.7 


Q2230 


338. 


.1471 


2.7986 


318.1 ± 25.9 


240.6 ±44.7 


Q2233+13 


339. 


.0800 


13.4389 


388.8 ± 30.6 


311.5 ±27.1 


FJ2334-09 


353. 


.6935 


-9.1367 




207.1 ±25.9 


Q2343+12 


356. 


.6176 


12.8167 


299.6 ± 5.4 


201.3 ±7.4 


Q2344 


356. 


.6908 


12.7583 


198.6 ± 16.4 


144.2 ± 18.8 
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APPENDIX A: FURTHER TESTS AND 
EXPERIMENTS 

In order to resolve the discrepancy between our measure- 
ment of the correlation and previous efforts, we have at- 
tempted a variety of changes, some of which are described 
below. Note that none of these changed our conclusions. 

The Eb-v values of MZ97 are based on fitting stellar 
models. We tried replacing them with corresponding values 
from SFD at the positions of their stars. As expected, since 
these are nearby stars and not all the way across the Galaxy, 
the SFD values are typically much higher, and the disagree- 
ment between our results and theirs becomes greater. In ad- 
dition the scatter increases significantly indicating that the 
original values were far more appropriate. 

We have attempted correcting the SDSS spectra for the 
velocity of the ISM using HI maps from Hartmann & Burton 
(1997). Every spectrum was de-redshifted by the median 
velocity we measured on its line of sight. This yielded no 
noticeable change, which is not surprising since the mean 
velocity in our fields is llkms -1 which amounts to sub A 
noise, and only a handful of targets are along directions with 
mean velocities greater than 50kms _1 . For simplicity we do 
not correct for these velocities throughout the paper. 

Another tracer of gas and dust is the KI absorption 
line at A7699. Repeating the procedure for the SDSS spectra 
around this wavelength we find that we cannot detect this 
line. This is consistent with the fact that while for NalD 
the weakest EWs we can measure are near 60 mA, the EWs 
of KI should be ~ 30 mA according to MZ97. 

We attempted replacing the SFD values with the pre- 
viously often- used maps of Burstein & Heiles (1982). While 
the resulting slope in EW vs Eb-v is indeed shallower, it is 
only due to the increased noise in the measurements. Basi- 
cally these maps are consistent with being a random reshuf- 
fling of the SFD maps with a dispersion of order 0.04 mag. 
We similarly tried using the Peek & Graves (2010) map, 
that refines the SFD map based on SDSS galaxy colors. We 
find a negligeable improvement in the agreement, but overall 
our departure from the MZ97 result is not resolved by their 
corrections. Again we do not use corrections from Peek & 
Graves (2010) for the sake of simplicity. 

Al A closer look at biases 

As discussed in Section 3, the binning of the SDSS spectra 
introduces a bias, due to the extremely uneven population 
with a significant overabundance of low-extinction/low-EW 
spectra. The amount of bias depends on the prevalence of er- 
roneous assignments to a given bin, which in turn depends 
on the precision of the SFD maps. Using simulations, we 
determined independently how much noise is present in the 
maps and how much bias need be corrected for in our mea- 
surements. 

We picked a wavelength range to introduce a fake ab- 
sorption line (A5870), randomized a line width similar to 
those we measure for the NalD lines, assumed a range of 
correlation slopes, and varied the amount of noise we add to 



the Eb-v values (we take a value of 100 mA for the intrinsic 
dispersion in the correlation, as corroborated by our other 
tests). Regardless of the input slope, after we bin the spec- 
tra and apply the same procedure we used for the real data, 
the slope we recover is between 0.89 (no Eb-v noise) and 
0.65 (o"e (b _ v) = 0.1) times the 'real' slope. Peek & Graves 
(2010) find that the maps they derive deviate from the SFD 
maps by less than 0.003 mag typically, though they do find 
deviations of order 0.045 mag is certain areas. Therefore the 
12% bias we obtain at o"e (b _ v) = 0.01 seems the most ap- 
propriate. This is consistent with our finding in Section 3, 
and our subsequent application of a 10% correction. 
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